README file for data and programs associated with:

Takao Kato and Chad Sparber
"Quotas and Quality: The Effect of H-1B Visa Restrictions on the Pool of Prospective Undergraduate Students from Abroad"


**********************
SAT DATA ACQUISITION
**********************
RAW SAT DATA
The authors used data that is the property of the College Board (Source: Derived from data provided by the College Board. Copyright (c) 2000-2008 The College Board. www.collegeboard.com). Data was used in accordance with a licensing agreement between the researchers and the College Board. To request data, contact The College Board, 45 Columbus Avenue, New York, NY 10023-6992. Phone: (212) 713-8088. Website: http://research.collegeboard.org/data/request. The authors' point of contact at the College Board was Sherby Jean-Leger, Assistant Director, Research and Development.

The authors acquired 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, and 2008 customized SAT Public Use Data Files with varying sample sizes of international students only. The data files contained SAT Program scores, demographic information, and SAT Questionnaire responses, with no personal identifying information. Appended to the file was a tier structure (described below) for the various designated institutions that students requested the SAT scores be sent to.

Note that in the years listed above refer to the spring of an academic year (e.g., "2008" refers to academic year 2007/08). Also, the 2000 dataset was not used in the analysis. Each year cost $500 to acquire. 


TIER STRUCTURE APPENDED TO DATASET
The authors provided information about college characteristics (e.g., school type, tier, region) that the College Board appended to its public use datasets in order to preserve the anonymity of institutions. Those characteristics are listed in REStatPublicFiles\CollegeTypes\Kato_Sparber_SchoolInfo.xlxs, with a guide to the codes available in REStatPublicFiles\CollegeTypes\CodeGuide.xlxs. This information would need to be supplied to the College Board before receiving the public use data.



**********************
FOLDERS AND FILES
**********************
The following folders contain Stata (version 12.0) files used for performing the data analysis. Note that path directories will need to be changed in each Stata do file to perform the analysis on other computers.


*CleanedSAT
This folder contains the do file CleanSAT.do. This file puts data provided by the College Board into Stata format and organizes the data for subsequent analysis. This is the first program that needs to be run in order to analyze the data. Important details about case selection and data organization are described in the do file (CleanSAT.do) itself. 

*CollegeTypes
This folder contains information about individual schools that the College Board appended to their public use data before providing it to users. See "TIER STRUCTURE APPENDED TO DATASET" section above.

*Exogeneity
Programs and data in this folder were used to provide evidence that trends in pre-binding period were uncorrelated with treatment/control effects.

*Fig1, Fig2, Fig3
These folders provide data and Stata programs for producing Figures 1 through 3 in the text.

*Macro
This folder contains IPUMS-provided data and user programs used to create measures of macroeconomic conditions in origin countries of immigrants. The data is used in the regressions of Table 6 in the text.

*Tables
This folder provides Stata do files for producing all regression estimates in Tables 1-10 of the paper. SAT regressions first require SAT data to be put in Stata format and organized as described in the "CleanedSAT" section above. Note that the Tables folder includes a sub folder "Tab10." This subfolder contains data and the Stata do file (Tab10_CaseStuyReg.do) necessary for performing the Case Study regression in Table 10. 




